Learning pseudo-Boolean k-DNF and submodular functions
نویسندگان
چکیده
We prove that any submodular function f : {0, 1}n → {0, 1, ..., k} can be represented as a pseudoBoolean 2k-DNF formula. Pseudo-Boolean DNFs are a natural generalization of DNF representation for functions with integer range. Each term in such a formula has an associated integral constant. We show that an analog of Håstad’s switching lemma holds for pseudo-Boolean k-DNFs if all constants associated with the terms of the formula are bounded. This allows us to generalize Mansour’s PAC-learning algorithm for k-DNFs to pseudo-Boolean kDNFs, and hence gives a PAC-learning algorithm with membership queries under the uniform distribution for submodular functions of the form f : {0, 1}n → {0, 1, ..., k}. Our algorithm runs in time polynomial in n, k log , 1/ǫ and log(1/δ) and works even in the agnostic setting. The line of previous work on learning submodular functions [Balcan, Harvey (STOC ’11), Gupta, Hardt, Roth, Ullman (STOC ’11), Cheraghchi, Klivans, Kothari, Lee (SODA ’12)] implies only n query complexity for learning submodular functions in this setting, for fixed ǫ and δ. Our learning algorithm implies a property tester for submodularity of functions f : {0, 1}n → {0, . . . , k} with query complexity polynomial in n for k = O((log n/ log log n)) and constant proximity parameter ǫ. This material is based upon work supported by NSF CAREER award CCF-0845701. Pennsylvania State University, USA. {sofya, grigory}@cse.psu.edu.
منابع مشابه
Efficient Minimization of Higher Order Submodular Functions using Monotonic Boolean Functions
Submodular function minimization is a key problem in a wide variety of applications in machine learning, economics, game theory, computer vision, and many others. The general solver has a complexity of O(n log n.E + nlogn) where E is the time required to evaluate the function and n is the number of variables [32]. On the other hand, many computer vision and machine learning problems are defined...
متن کاملComputing Science WHICH SUBMODULAR FUNCTIONS ARE EXPRESSIBLE USING BINARY SUBMODULAR FUNCTIONS?
Submodular functions occur in many combinatorial optimisation problems and a number of polynomial-time algorithms have been devised to minimise such functions. The time complexity of the fastest known general algorithm for submodular function minimisation (SFM) is O(n6 + n5L), where n is the number of variables and L is the time required to evaluate the function. However, many important special...
متن کاملPersistency for higher-order pseudo-boolean maximization
A pseudo-Boolean function is a function from a 0/1-vector to the reals. Minimizing pseudo-Boolean functions is a very general problem with many applications. In image analysis, the problem arises in segmentation or as a subroutine in task like stero estimation and image denoising. Recent years have seen an increased interest in higher-degree problems, as opposed to quadratic pseudo-Boolean func...
متن کاملInductive Learning from Good Examples
We study what kind of data may ease the computational complexity of learning of Horn clause theories (in Gold's paradigm) and Boolean functions (in PAC-learning paradigm). We give several deenitions of good data (basic and generative representative sets), and develop data-driven algorithms that learn faster from good examples, and degenerate to learn in the limit from the \worst" possible examp...
متن کاملMonotone, Horn and Quadratic Pseudo-Boolean Functions
A pseudo-Boolean function (pBf) is a mapping from f0; 1gn to the real numbers. It is known that pseudo-Boolean functions have polynomial representations, and it was recently shown that they also have disjunctive normal forms (DNFs). In this paper we relate the DNF syntax of the classes of monotone, quadratic and Horn pBfs to their characteristic inequalities.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013